Skip to content

examples : update vad support in stream example #3160

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

danbev
Copy link
Collaborator

@danbev danbev commented May 15, 2025

This commit updates the stream example to use the VAD support in whisper instead of the simple_vad that it currently uses.


After first trying to just use the current VAD support that is "built-in" when calling whisper_full it became difficult to update this example. As a suggestion I've exposed a whisper_vad function which allows the VAD processing to be called separately and then if there if speech is detected the vad_samples that are returned can be used to with whisper_full but this time there is no need to processing VAD so params.vad should be set to false.

The built-in VAD works well for sample files but does not seem to be optimal for streaming, at least that is my feeling after trying to update this example.

@danbev danbev force-pushed the stream-example-vad-update branch from 950d7c5 to b924706 Compare May 16, 2025 13:30
@danbev danbev force-pushed the stream-example-vad-update branch from 654f1b4 to ce75a8e Compare June 2, 2025 04:15
danbev added 2 commits June 9, 2025 12:08
This commit updates the stream example to use the new Voice Activity
Detection (VAD) support instead of the simple_vad which it currently
used.
@danbev danbev force-pushed the stream-example-vad-update branch 2 times, most recently from 06395a6 to c8f8b4d Compare June 9, 2025 10:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant